Entropy-Based Dynamic Rescoring with Language Model in E2E ASR Systems

نویسندگان

چکیده

Language models (LM) have played crucial roles in automatic speech recognition (ASR), whether as an essential part of a conventional ASR system composed acoustic model and LM, or integrated to enhance the performance novel end-to-end systems. With development machine learning deep learning, language modeling has made great progress natural processing applications. In recent years, efforts been leverage advantages LM ASR. The most common way apply integration is still shallow fusion because it can be easily implemented by zero-overhead while obtaining significant improvement. Our method further applicability without hyperparameter tuning maintaining similar performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Approach of Language Model Applying in ASR Systems

Language model plays a pivotal role in large vocabulary speech recognition systems. Providing more syntactic and semantic information, high-level language models hold stronger ability in guiding the search process and hence optimizing the final result. But on the other hand, complex language models, compared with simple ones, usually introduce proportional computing workload that jeopardizes th...

متن کامل

ASR-based systems for language learning and therapy

ASR-based CALL seems to offer many possibilities for language learning and therapy. However, in both domains the speech of the users generally differs substantially from standard speech. ASR of such atypical speech is complex and challenging. Furthermore, developing successful CALL systems requires a mix of expertise. This combination of factors has led to misconceptions and pessimism on the us...

متن کامل

Attacking Paper-Based E2E Voting Systems

In this paper, we develop methods for constructing votebuying/coercion attacks on end-to-end voting systems, and describe votebuying/coercion attacks on three proposed end-to-end voting systems: Punchscan, Prêt-à-voter , and ThreeBallot. We also demonstrate a different attack on Punchscan, which could permit corrupt election officials to change votes without detection in some cases. Additionall...

متن کامل

Simulation-based analysis of E2E voting systems

End-to-end auditable voting systems are expected to guarantee very interesting, and often sophisticated security properties, including correctness, privacy, fairness, receipt-freeness, . . . However, for many well-known protocols, these properties have never been analyzed in a systematic way. In this paper, we investigate the use of techniques from the simulation-based security tradition for th...

متن کامل

Fuzzy class rescoring: a part-of-speech language model

Current speech recognition systems usually use word-based trigram language models. More elaborate models are applied to word lattices or N best lists in a rescoring pass following the acoustic decoding process. In this paper we consider techniques for dealing with class-based language models in the lattice rescoring framework of our JANUS large vocabulary speech recognizer. We demonstrate how t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app12199690